Trading Accuracy for Size: Online Small SVMs via Linear Independence in the Feature Space
نویسندگان
چکیده
Support Vector Machines (SVMs) are a machine learning method rooted in statistical learning theory. One of their most interesting characteristics is that the solution achieved during training is sparse, meaning that a few samples are usually considered “important” by the algorithm (the so-called support vectors) and give account of most of the complexity of the classification/regression task. It is then crucial to SVMs to keep the number of support vectors as small as possible, since both the training and testing time depend on the size of the training set and the number of support vectors. In fact, in recent literature this has become a key issue in order to speed up SVMs without losing accuracy. In this paper we propose Online Independent Support Vector Machines, an incremental way of reducing the number of support vectors, based upon linear independence in the feature space. Experiments reveal that, at least on the standard tests we have chosen, our machines achieve a dramatic reduction in the number of support vectors without losing accuracy, especially when finite-dimensional kernels are used.
منابع مشابه
تعیین ماشینهای بردار پشتیبان بهینه در طبقهبندی تصاویر فرا طیفی بر مبنای الگوریتم ژنتیک
Hyper spectral remote sensing imagery, due to its rich source of spectral information provides an efficient tool for ground classifications in complex geographical areas with similar classes. Referring to robustness of Support Vector Machines (SVMs) in high dimensional space, they are efficient tool for classification of hyper spectral imagery. However, there are two optimization issues which s...
متن کاملOnline Streaming Feature Selection Using Geometric Series of the Adjacency Matrix of Features
Feature Selection (FS) is an important pre-processing step in machine learning and data mining. All the traditional feature selection methods assume that the entire feature space is available from the beginning. However, online streaming features (OSF) are an integral part of many real-world applications. In OSF, the number of training examples is fixed while the number of features grows with t...
متن کاملSupervised Feature Extraction of Face Images for Improvement of Recognition Accuracy
Dimensionality reduction methods transform or select a low dimensional feature space to efficiently represent the original high dimensional feature space of data. Feature reduction techniques are an important step in many pattern recognition problems in different fields especially in analyzing of high dimensional data. Hyperspectral images are acquired by remote sensors and human face images ar...
متن کاملIndoor Place Recognition using Online Independent Support Vector Machines
In the framework of indoor mobile robotics, place recognition is a challenging task, where it is crucial that self-localization be enforced precisely, notwithstanding the changing conditions of illumination, objects being shifted around and/or people affecting the appearance of the scene. In this scenario online learning seems the main way out, thanks to the possibility of adapting to changes i...
متن کاملFeature Selection for Small Sample Sets with High Dimensional Data Using Heuristic Hybrid Approach
Feature selection can significantly be decisive when analyzing high dimensional data, especially with a small number of samples. Feature extraction methods do not have decent performance in these conditions. With small sample sets and high dimensional data, exploring a large search space and learning from insufficient samples becomes extremely hard. As a result, neural networks and clustering a...
متن کامل